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PSADBW Instruction (Prior Art) 
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abs(X7-Y7) + abs(X6-Y6) + abs(X5-Y5) + abs(X4-Y4) + abs(X3-Y3) + abs(X2-Y2) + abs(X1-Y1) + abs(XO-YO) 



TEMPO <== ABS(XO-YO) 
TEMPI <==ABS(X1-Y1) 
TEMP2<==ABS(X2-Y2) 
TEMP3<==ABS(X3-Y3) 
TEMP4<==ABS(X4-Y4) 
TEMPS <==ABS(X5-Y5) 
TEMP6<==ABS(X6-Y6) 
TEMP7 <==ABS(X7-Y7) 



RESULT <== SUM(TEMPO, TEMPI, TEMP2, TEMP3, TEMP4, TEMPS, TEMP6, TEMP7) 



fig. 2 



Microprocessor 



r 



200 



PSADBW macroinstruction 100 



instruction translation logic 202 I 



PMULSAD 212 



PSUBSAD 214 



^204 microinstruction queue 



+ 



MMXunit m. 



Fig. 3 



'104 minuend operand 



X7 


X6 


X5 


X4 


X3 


X2 


X1 


xo| 













2/3 
MMXUnit 

microinstr 306 I 



+ 



206 



-106 subtrahend operand 



I 



Y7 


Y6 


Y5 


Y4 


Y3 


Y2 


Y1 


Y0| 







f 



^312 difference cany bits 



C7 



C6 



+ 



C5 



C4 



carry-generating packed subtraction logic ^ 

I 



C3 



C2 



C1 



CO^ 



X7-Y7 



316 byte inverter 



X6-Y6 



X5-Y5 



1 / 



314 pacl(ed differences 



X4-Y4 



X3-Y3 



1 

18/^ 



X2-Y2 



X1-Y1 



XO-YO 



Z7 



Z6 



Z5 



0/ o/ 

318^^p^ 318^^p^ 



Z4 



324 microinstr type: 1 = PMULSAD, 0 = multiply instr 
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PSADBW result: sum{abs(X7-Y7) ... abs(XO-YO)) 108 I 
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Generation of Packed Sum of Absolute Differences 
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execute PSUBSAD microinstruction: subtracters subtract 
subtrahend from the minuend to generate eight packed differences 
and their associated can7 bits; 
a 1 carry bit indicates a negative difference; 
a 0 can7 bit indicates a positive difference 
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selectively invert each of the eight differences if its con-esponding 
carry bit is negative 
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adders add the multiplier partial 
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execute the PMULSAD instruction: adders add the eight selectively 
inverted differences and the eight carry bits to generate the 
PSADBW result 
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